Using Mendelian inheritance to improve high-throughput SNP discovery.

نویسندگان

  • Nancy Chen
  • Cristopher V Van Hout
  • Srikanth Gottipati
  • Andrew G Clark
چکیده

Restriction site-associated DNA sequencing or genotyping-by-sequencing (GBS) approaches allow for rapid and cost-effective discovery and genotyping of thousands of single-nucleotide polymorphisms (SNPs) in multiple individuals. However, rigorous quality control practices are needed to avoid high levels of error and bias with these reduced representation methods. We developed a formal statistical framework for filtering spurious loci, using Mendelian inheritance patterns in nuclear families, that accommodates variable-quality genotype calls and missing data--both rampant issues with GBS data--and for identifying sex-linked SNPs. Simulations predict excellent performance of both the Mendelian filter and the sex-linkage assignment under a variety of conditions. We further evaluate our method by applying it to real GBS data and validating a subset of high-quality SNPs. These results demonstrate that our metric of Mendelian inheritance is a powerful quality filter for GBS loci that is complementary to standard coverage and Hardy-Weinberg filters. The described method, implemented in the software MendelChecker, will improve quality control during SNP discovery in nonmodel as well as model organisms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance comparison of SNP detection tools with illumina exome sequencing data—an assessment using both family pedigree information and sample-matched SNP array data

To apply exome-seq-derived variants in the clinical setting, there is an urgent need to identify the best variant caller(s) from a large collection of available options. We have used an Illumina exome-seq dataset as a benchmark, with two validation scenarios--family pedigree information and SNP array data for the same samples, permitting global high-throughput cross-validation, to evaluate the ...

متن کامل

SNPP: automating large-scale SNP genotype data management

UNLABELLED To manage high-throughput single nucleotide polymorphism (SNP) genotyping data efficiently, we developed a dynamic general database management system-SNPP (SNP Processor). It provides several functions, including data importing with comparison, Mendelian inheritance check within pedigrees, data compiling and exporting. Furthermore, SNPP may generate files for repeat genotyping and tr...

متن کامل

High-throughput targeted SNP discovery using Next Generation Sequencing (NGS) in few selected candidate genes in Eucalyptus camaldulensis

Background The present era of high throughput technologies offer immense promise and innovative applications for SNP discovery and high quality parallel genotyping [1,2]. Using advancements in the next generation sequencing (NGS) technologies, the en masse SNP discovery for targeted genomic regions is possible for eucalypts. The river red gum or Eucalyptus camaldulensis (Ec) is a fast growing, ...

متن کامل

Visualization of uniparental inheritance, Mendelian inconsistencies, deletions, and parent of origin effects in single nucleotide polymorphism trio data with SNPtrio.

A variety of alterations occur in chromosomal DNA, many of which can be detected using high density single nucleotide polymorphism (SNP) microarrays. These include deletions and duplications (assessed by observing changes in copy number) and regions of homozygosity. The analysis of SNP data from trios can provide an additional category of information about the nature and origin of inheritance p...

متن کامل

Whole genome association studies of neuropsychiatric disease: An emerging era of collaborative genetic discovery

Family history, which includes both common environmental and genetic effects, is associated with an increased risk for many neuropsychiatric diseases. Investigators have identified several disease-causing mutations for specific neuropsychiatric disorders that display Mendelian segregation. Such discoveries can lead to more rational drug design and improved intervention from a better understandi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genetics

دوره 198 3  شماره 

صفحات  -

تاریخ انتشار 2014